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x -> f(x - b) as b ranges over R n . It is not too difficult to show (see, for example, [I], or [2]), that 
/is an exponential polynomial if and only if Uf is finite dimensional. So, if X f is finite dimensional, 
as Uf c X fi f is an exponential polynomial. Now if / contains any exponential exp (a • x ) with 
a # 0 in R n , we would find that since X f contains dilations as well as translations, X f would be of 
infinite dimension. Hence a = 0 and so / is a polynomial. 

We conclude by noting a related result of Loewner [2] that if n > 1 and if / e C(R n ) is such 
that its transforms under all isometries of R n span a finite dimensional space, then / is a 
polynomial. Thus, if R f denotes the subspace of C(R n ) spanned by all translations of / and all 
orthogonal transformations O p f: x -> f(Px ), where P is any orthogonal real matrix, the following 
holds. 

Theorem 3. Let n > 1 and /e C(R n ). A necessary and sufficient condition that f be a 
polynomial is that Rf be finite dimensional. 

This theorem was first shown by Loewner [2] using exponential polynomials. Whether it can 
also be shown by methods similar to those used in the main proof of our Theorem 2 is an 
interesting conjecture to us. 

Acknowledgements. The authors thank Dr. R. Nillsen of the University of Wollongong, and Professor J. Wilker 
of the University of Toronto for their helpful discussions. 
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POISSON APPROXIMATIONS AND THE DEFINITION OF THE POISSON 

PROCESS 

TIMOTHY C. BROWN 

Department of Mathematics, Monash University, Clayton, Victoria 3168, Australia 

1. Introduction. This article has two purposes. The first is to expose two interesting, connected 
facts about the Poisson distribution, both of which deserve to be better known. The first of these 
is that it is almost as easy to provide error bounds for certain Poisson distribution approximations 
as it is to derive these approximations via convergence arguments. The second one is that these 
bounds make it extremely easy to prove an elegant qualitative characterization of the Poisson 
process, due to Prekopa [20]. 

The second aim of the article is to advertise two important general ideas in probability theory. 
Both ideas have been extremely useful in recent research, but authors of elementary textbooks 
usually consider these ideas to be too advanced for inclusion. On the other hand, the exposition 
here only requires that the reader know some undergraduate real analysis and the most trivial 
facts about probability. Thus, I hope that the simple examples here might make the ideas more 
widely known. 

Coupling , a technique introduced by Doeblin [5], is one basic idea in this article. By this we 
mean the study of one or two specific distributions, via the construction of two random variables 
(or vectors or processes). The coupled random variables are designed so that not only do they have 


Timothy C. Brown: I did my Ph.D. at Cambridge University under G. K. Eagleson. After three years as a lecturer 
at the University of Bath, U.K., I returned to my native land in 1981. My chief extramathematical interests are 
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specified ( marginal ) distributional properties, but also it is possible to analyse their joint 
( point-by-point ) behaviour. We shall use coupling in Section 2 to provide the error bounds. A 
(random!) sample of other areas in which coupling has proved fruitful are: the convergence of 
transition probabilities of Markov chains [5], [19], the renewal theorem [15], interacting particle 
systems [23], the F. K. G. inequality of statistical physics [10], simulation algorithms for spatial 
processes [16], random walks on finite groups, rapidly mixing Markov chains and the number of 
shuffles needed to thoroughly mix a pack of cards [2], and the tail behaviour of birth-and-death 
and stochastically monotone processes [1]. For an excellent survey of coupling methods for 
Markov processes, the reader may like to consult [8]. 

The other central idea here is that the path properties of stochastic processes are crucial in their 
definition. As an example, let us consider Brownian motion. In the simplest case, this is a model 
for the position of a particle executing a “random” motion in one dimension. In introductory 
courses, the model is often introduced as a stochastic process with certain joint normal distribu- 
tions for the positions of the particle at different times. However, it seems initially that this 
definition hardly corresponds at all to the physical idea of a particles motion. On the other hand, 
the following model assumptions are easily interpretable: 

(a) each path of the particle is continuous, 

(b) the path of the particle in the time interval [0, /] is independent of the path in [t 9 1 + s] 9 
apart from the common position at time t, and 

(c) the mean position at time t is 0, while the variance of the position is t. 

It is a consequence of a celebrated result of Levy [14, p. 78] that precisely formulated versions of 
(a), (b), and (c) are enough to guarantee that the model is Brownian motion. More recently, the 
profound work of Stroock and Varadhan [22] has showed that characterizations like Levy’s can be 
extended to a large class of processes satisfying (a). As a result, some open questions on the 
existence and uniqueness of solutions of partial differential equations were settled. In Section 3 we 
show that it is very simple to prove the characterization of the ordinary Poisson process that is 
analogous to (a), (b) and (c). That proof is then easily modified to yield the much more general 
characterization of Prekopa [20]. 

2. Poisson Approximations. In this section, we give some results about the error in approxi- 
mating the distribution of certain random variables by Poisson distributions. Let us first recall 
that the Poisson distribution with parameter X ^ 0 attributes probability 

P(j\ X) = E 

j^J 


to the set / c (0, 1, 2, ... }. 

Suppose that A lf . . . ,A n are independent events in a probability space (R, ^,P). For example, 
in a random sample taken with replacement, A l might represent the event that the i th person 
sampled has a certain disease. The random variable S , which gives the number of A l9 ...,A n that 
occur, is often of interest; in the example, this would be the number in the sample who have the 
disease. Formally, 

(1) S=t IA t 

i=l 

where IA t is the indicator (or characteristic) random variable for the event A i (that is, IA, ( co) = 1 
if co e A i9 and Z4 y (co) = 0 otherwise). If the probabilities 

Pi F( A i ) , i 1 > • • • > n , 

are all the same, then S has a binomial distribution. On the other hand, if the p's vary, then the 
distribution is much more complicated. However, in both cases the distribution of S can be 
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approximated by a Poisson distribution in certain circumstances; intuitively these are when each 
event is “rare,” i.e., each p { is small. More precisely, we have the following theorem due to Le Cam 
[13]. 

Theorem 1. For any set J c {0, 1, 2, ... }, 

|P(S e I <E/>, 2 

where here , and for the rest of this section , unmarked sums are over i = 1, . . . , n. 

We present a modification of a proof of Theorem 1 due to Serfling [21]. Here is the key tool: 

The Coupling Lemma. If X and Y are any random variables and J is any ( Borel ) set of R, then 
\P(X e/)-P(7e/)|< P( X * 7). 

The Coupling Lemma is in itself trivial, as you can see from the proof. However, the quantities 
in Theorem 1 are very difficult to bound analytically and the power of the Coupling Lemma is 
that it turns this analytic problem into one of probabilistic construction. For, provided X is any 
random variable with the same distribution as S, and Y is any random variable with Poisson (E p t ) 
distribution, the Coupling Lemma tells us that P(X ¥> Y) is a bound for the left side in Theorem 
1. The key point here is that the joint law of X and Y can be arbitrary, subject only to their having 
the correct marginals. As in Euclidean geometry, clever construction is just what the doctor 
ordered! An advantage of the proof of Theorem 1 as an introduction to coupling is that, in 
contrast to many other applications of coupling, the required coupling is quite straightforward. 

Proof of Coupling Lemma. Without loss of generality P(I g /) > P(7 e J). In this case, the 
required modulus is bounded by 

P(Ig/)-P(Ig/ and 7 g/) = P(Ig/ and Y<£J), 
and the latter is clearly bounded by J*(X # Y). 

The coupling we present relies on a fundamental construction in probability. This is the 
construction showing that a probability space which supports a uniform random variable also has 
one of any other distribution. It is also important for the computer simulation of random 
variables; usually the computer has an algorithm which produces uniform random variables and 
the construction can (at least in principle) then turn these into any desired distribution. We 
describe the construction only in the simpler discrete case and refer the interested reader to [3, 
p. 159] for the general case. Suppose that U is a uniform random variable over (0,1) (i.e., its 
distribution is Lebesgue measure on (0, 1)). Suppose also that the desired distribution Q attributes 
probability q(i) to { / } ( e (0, 1, 2, . . . }). For x e (0, 1), let 

Q(x ) = i 

for the unique i e {0, 1, 2,. . . } such that 

£ q(j) < x < £ q(j), 

7=0 7 = 0 

where the sum on the left with i = 0 is by convention 0. The constructed random variable is then 
Q(U). It is immediate that Q(U) does have distribution Q, because for i e {0, 1, 2, . . . }, 

£?(./)> t 

\j-o 7-0 

Recall that we wish to construct random variables X and Y with prescribed distributions, say Q 
and R, so that J*(X =£ Y) is as small as possible. The construction above gives us an obvious 
attempt for such similar random variables. We simply take a single uniform random variable U 


p (Q(u) = i) = p 
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and set X = Q(U ) and Y = R(U). Unfortunately, P(X # Y) is generally still hard to calculate if 
we use such X and Y in Theorem 1. However, for n = 1 the calculation is easy. In this case, let B 
be the distribution of S; B is said to be a Bernoulli distribution with parameter p = p v Let P 
denote the Poisson distribution with parameter p. Then, P(B(U) ¥= P(U) is the length of the 
dotted intervals in Fig. 1, and this is 

(e~ p ~( 1 ~p)) +(1 - e~ p {\ +p))<p 2 , 

as required, since 

( 2 ) 1 -P<e- p , 

an inequality which also justifies the crucial ordering of the points marked in Fig. 1. 



Fig. 1 

Parenthetically, we remark that ( B(U ), P(U)) is called a maximal coupling , because P (B(U) =£ 
P(U)) actually achieves equality in the Coupling Lemma for J = {0,2, 3,. . . As a consequence, 
p( 1 - e p ) is actually the best possible bound in Theorem 1 for the case n = 1. While there is a 
method to construct a maximal coupling for any distributions Q and R [12], even for discrete 
distributions it is not true that ( Q(U ), R(U)) is always a maximal coupling. For an example, 
consider R equal to the Bemoulli(l/2) distribution and Q given by Q{0) = 0(1} = Q{2} = \\ 

here P (R(U) # Q(U )) = |, while the maximum of |i£(/) - Q(J) | is \. Nevertheless, 
( Q(U ), R(U)) does always have a weaker maximal similarity property, namely that it maximises 
the correlation between all pairs of random variables with distributions R and Q (see, for example, 
[24], Theorem 2.5). 

Proof of Theorem 1. Let U l9 . . . , U n be independent uniform random variables over (0, 1). Let B t 
and P t (i = 1 ,...,«) denote the Bernoulli and Poisson distributions both with parameters p i9 and 
denote B^Uf) by X t and P^Uf) by Y t . Clearly, (X i9 Yf), i = 1 ,...,«, are independent random 
vectors. Therefore, has the same distribution as S. Moreover, by a standard result, LY t has the 
P(.;E/? / ) distribution [6, p. 237]. At the same time, LX i ¥= LY t implies X t # Y t for some i in 
{1 Combining these facts with the Coupling Lemma produces 

|P(S e /) - P(J-,Z Pi )\ < p| U {X * 1/}) 

<EP(^¥= ^). 

Applying the argument for n = 1 to each term in the sum completes the proof. 

Often it is not convenient to have the parameter of the approximating Poisson distribution 
equal to E p t . 

Corollary 2. For any J c (0, 1, 2, ... ) and any X > 0, 

|P(S e/) - P(J ; X)\ < \X - E p t \ +(max^ / )E^ / . 
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The corollary follows from the triangle inequality, Theorem 1, and the inequalities 

Lpf < (max/>,)E/>, 

and 

( 3 ) 

To see the truth of (3), we use an argument of Freedman [7]. Suppose that E/?, > X. Let Z x and Z 2 
be independent Poisson random variables with parameters X and E/?, - X. By the Coupling 
Lemma, the left side of (3) is bounded by P (Z x i= Z x + Z 2 ), which in turn is bounded by 

P(Z 2 * 0) < 1 - e ~^p- x \ 

The inequahty (2) now gives (3). A symmetrical argument gives (3) if E/?, < X. 

To close this section we mention that the coupling technique can also produce bounds similar 
to Theorem 1 for dependent events and for multinomial distributions. The interested reader is 
referred to [21], [4], and [18]. 

3. Defining the Poisson Process. The bounds of the previous section are now applied to see 
that the general Poisson process arises from purely qualitative assumptions. But first we show how 
the ideas of that section provide a natural way to introduce the Poisson process in an elementary 
course (i.e., without (ft, J^,P)!). 

Feller [6, p. 159] cites many examples where the Poisson distribution provides an adequate fit 
to experimental data. We shall consider the first of Feller’s examples and construct a mathemati- 
cal model for it. In this example, a radioactive substance emits a-particles; the number TV, of 
emissions reaching a given space up to time t > 0 is recorded. The following assumptions appear 
not unreasonable: 

(1) that {TV,} is a point process on U + , by which we mean that each TV, is a random variable and 
that a graph of TV, versus t is always an increasing step function with jumps of size 1 and TV 0 = 0 
(the assumption that the jumps in the graph are of size 1 corresponds to the idea that no two 
emissions occur at exactly the same time), 

(2) that {TV,} is completely random , by which we mean, informally, that information concerning 
the emissions in one interval of time tells us nothing about the emissions in a disjoint interval of 
time or, formally, that for each integer n and 0 < t x < • • • < t n9 

N h9 N t2 -N h9 ... 9 N tn -N tn _ l 

are independent (lack of complete randomness corresponds to the idea that past information can 
tell us something about future emissions), 

(3) that {TV,} is stationary (in increments ), by which we mean that TV, — N s has the same 
distribution as N v - N u provided t - s = v - u (of course, in the long term, this cannot be true as 
the radioactive substance decays, but it will be approximately true over intervals short compared 
to the half life). 

Theorem 3. If {TV,} is a stationary , completely-random point process on U + 9 then each TV, has a 
Poisson distribution. 

Before proving this theorem, it is instructive to compare Theorem 3 with treatments of the 
Poisson process in textbooks. Firstly, it seems almost universal that in the first treatment of the 
Poisson process, assumption (1) is replaced by a stronger and practically less interpretable 
assumption concerning the probability of two or more emissions in a short time interval. 
Complete randomness is usually assumed, but the simple assumption (3) is usually replaced by the 
assumption that there exists X so that for all t 9 
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J>(N t+h ~N t = l) = \h + o(h) 

as h -> 0. It initially puzzled me as a student that a model for radioactive emissions should be 
formulated thus; perhaps I was not alone. In any event, with these usual assumptions there are 
two ways to show that N t has a Poisson distribution. Either some differential equations are 
formulated and solved (in which case the proof is longer than the proof of Theorem 3) or the 
Poisson approximation to the binomial is used (in which case the proofs structure is much the 
same as that below, but the stronger assumptions do not make the proof very much easier). 
Finally, in revising the manuscript, I discovered that Feller [6, p. 157] sketches a similar proof to 
the one below; it is hoped that the detailed treatment here will convince others that this is the 
natural way to introduce the Poisson process to students. 

Proof. We will define for each n = 1,2,... a random variable S n of the form of S in (1). It is 
then shown that 

(4) ?(N t *S n )^0 

as n -> oo, and also that there exists X such that for each m in {0,1,2,. . . } 

(5) P (S n = m) -> e~ x X m / ml 
as n -> oo along a subsequence of N. The inequality 

|P (N t = m)~ e~ x X m /m\\ < |P(JV, = m) - P(S n = m)\ + \?(S n - m) - e~ x X m /m\\ 

and the Coupling Lemma then finish the proof. 

To define S n we consider the partition & n of (0, t] into n (left-open, right-closed) intervals of 
equal length. For i = 1,. . . ,«, we let A ni be the event that the i th interval of & n has at least one 
emission in it and let S n be the number of A nl ,. . . ,A nn which occur. 

Let D be the random variable which gives the minimum time between emissions in (0, t\ If 
N t # S„, then at least one interval of & n has two or more emissions in it, so that D < l/n. Hence, 
P (N t # S n ) is bounded above by P (D < l/n). Because {N t } is a point process, D > 0. This 
implies that the distribution function of D is 0 at 0 and, since it is also right continuous, 
P (D < l/n) 0 and (4) follows. 

By complete randomness, the events A nl ,. . . ,A n „ are independent. To establish (5), we may 
thus apply Corollary 2 with S = S n and J = ( m }. By stationary, T*(A nl ) = • • • = P (A nn ) = p n , 
say, and thus the bound of Corollary 2 is \np n - X\ + ( np n ) 2 /n . Hence, it suffices to show that 
{ np n } is a bounded sequence, for, in that case, the Bolzano-Weierstrass theorem gives us a 
subsequence along which np n converges. 

The random variable N t is zero if, and only if, each A ni does not occur. Thus, by complete 
randomness and stationarity, 

(6) P(^r = 0) = (1 — Pn)" ■ 

A consequence of this is that if P (N t = 0) were 0, then it would be certain that each A ni would 
occur. But then, since S n < N t for all w, we would have N t = oo, contradicting the assumption that 
{ N t } is a point process. Thus we may take negative logs in (6) to conclude that n{— ln(l — p n )} 
equals the finite number L = —\nP(N t = 0). The inequality x < -ln(l - x) (x < 1) now tells us 
that L is an upper bound for np n . Since clearly np n > 0, the required boundness is now 
established. 

A couple of remarks on this proof seem desirable. There are two methods to avoid the use of 
subsequences in the proof. One way (Feller [5, p. 157]) uses the fact that — n ln(l - p n ) converges 
only if np n converges. The other way would be to define & n to have 2" intervals, in which case it is 
easy to show that np n increases with n. Strictly speaking, we only need the usual binomial 
convergence to Poisson in the proof and not the bound of Corollary 2; but the bound is the easiest 
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way to establish the convergence needed for the next theorem. 

The parameter X introduced in the proof clearly depends on t , and so we denote it as X(t). It is 
now fairly easy to show that X(t) is linear in t. First, the fact that N t has a Poisson X(t) 
distribution implies that 

(7) A(f) = -InP (N(t) = 0) = n\(t/n ) 

from (6). Let i and j be arbitrary integers > 0. Setting t = 1 and n =j in (7) gives X(l) = j\(l/j). 
If we set t = i/j and n = i, the same equation gives X(i/j ) = i\(l/j) 9 and from this and die 
previous equation we deduce that X(t) = A(l), for all rational t. To extend the latter to real t 9 we 
consider the distribution function, F, of the time, T l9 to the first emission. Since {7\ > t) = {N t 
= 0}, (7) gives X(t) = -ln(l - F(t)) and the right continuity of F yields the required extension. 

We now generalise Theorem 3 to the setting of point processes on U p (p ^ 1) and drop the 
assumption of stationary (with the requisite topological facts it is straightforward to replace U p 
by a locally compact, second countable, Hausdorff space as in [ 11 , p. 48]). Since U p has a more 
complicated order structure, we need to describe our process differently and an easy way to do 
this is by specifying an appropriate sample space, Q. 

The samples we are trying to model are random sets of points in U p (in the previous example, 
each point is an emission). Two sensible restrictions are that each set co is countable and that each 
co has finite intersection with every bounded set. Accordingly, we take £2 to be the set of all such co 
and for any bounded set B define N(B, co) to be the cardinality of co n B (in the previous example 
N t (o)) = N(( 0, t] 9 co)). For B a bounded Borel set, we want N(B) : co -> N(B , co) to be a random 
variable (so that { N t } is a point process on U + in the example). Accordingly, we call a probability 
measure P a point process on U p if P has domain a a- algebra i^on Q, which is large enough so that 
for any B QU P , N(B ) is a random variable (hereafter, all subsets of U p are assumed bounded 
Borel). Informally, P is the law that dictates how to, choose the points of the process. The point 
process is completely random if N(Bf ) 9 . . . ,N(B n ) are independent whenever B l9 ... ,B n are disjoint 
subsets of U p (that, in the previous example, this definition is no stronger than the one given is a 
standard measure theoretic exercise). The point process is called a Poisson process if each N(B) 
has a Poisson distribution. In this case, since for x ^ U p , N{x} only takes the values 0 or 1, the 
parameter of the distribution of N{x} must always be 0. Thus, a Poisson process has no fixed 
atoms , meaning that for each x e R p , P(A{x) = 0) = 1. This qualitative property appears quite 
reasonable in many applications; an exact time point at which there was a positive probability of 
observing a radioactive emission would be very special indeed. The remarkable fact along these 
lines is due essentially to Prekopa [ 20 ]. 

Theorem 4. A completely random point process is a Poisson process if 9 and only if, it has no fixed 
atoms. 

A proof of Theorem 4 is presented by detailing some necessary changes to that of Theorem 3; 
the result seems much simpler than existing proofs of Theorem 4. Since we now have defined an 
underlying probability space, we also show how to eliminate the nonmathematical language in the 
previous proof. 

Proof. We have already noted that it is necessary that P has no fixed atoms. 

We fix a subset B of R p and replace every occurrence of N t in the previous proof by N(B). The 
major change is in the definition of the partition which will no longer have n sets in it, nor will 
these sets be intervals of equal size. Let x e B and let D x (oi) be the minimum of | y - x\ for the 
finitely many y in co n B. The assumption of no fixed atoms means that each D x has a distribution 
function which is 0 at 0. Using right continuity again, we may thus choose a strictly positive 
d x ^.l/n such that P (D x < d x ) < l/n. Thus, if C x is the open ball with centre x and radius d x9 

P (N(C X ) > 1) < l/n. 
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The Heine-Borel Theorem then allows us to choose a finite subcover of B from the sets {C x 
By taking suitable intersections [9, p. 32], we can use this subcover to form a partition 
& n = {B nl ,. . . ,B nk ] of B (where of course k actually depends on n). We now let A ni = { N{B ni ) 
> 1} (/ = 1 and S', be S' in (1) except that now and hereafter the sum is over i e {1 

To demonstrate ( 4 ), we proceed as before after making the formal definition that Z>(w) is the 
minimum of | y - z| such that y and z are both in w n 5, and noting that N(B , w) # »S w ((o) 
implies there exist G w such that w z and co 7 both lie in the same set of 3P n . To demonstrate 

(5), we note that even in the absence of stationary, it still suffices to show that {Lp ni } n is 
bounded. This is because each B { in & n is contained in some C x and so the maximum of p nl ,. . . ,p nk 
is bounded above by l/n. Finally, we note that P (N(B) = 0) now becomes the product of 
(1 - p n i),. . . ,(1 - p nk ) and by the last statement this exceeds (1 - l/n) n > 0 for n > 2. Thus, we 
still get the finite number -lnP(AT(5) = 0) as an upper bound for Lp ni . 

4. Acknowledgement. The author is grateful to J. G. Kupka for many valuable conversations 
on this material. 


References 

1. D. J. Aldous, Tail behaviour of birth-and-death and stochastically monotone processes, Z. Wahrsch. Verw. 
Gebiete, 62 (1983) 361-374. 

2. D. J. Aldous, Random walks on finite groups and rapidly mixing Markov chains, Seminaire de Probability 
XVII, Lecture Notes in Mathematics, 986, Springer, 1983. 

3. P. Billingsley, Probability and Measure, Wiley, New York, 1979. 

4 . T. C. Brown, Some Poisson approximations, Ann. Probab., 11 (1983) 726-744. 

5 . W. Doeblin, Expose de la theorie des chaines simples constantes de Markov a un nombre fini d’etats, Rev. 
Math. Union Interbalkanique, 2 (1938) 77-105. 

6. W. Feller, An Introduction to Probability Theory and Its Applications, vol. 1, Wiley, New York, 1968. 

7 . D. Freedman, The Poisson approximation for dependent events, Ann. Probab., 2 (1974) 256-269. 

8. D. Griffeath, Coupling methods for Markov processes, in Studies in Probability and Ergodic Theory (Editor 
G. Rota), Academic Press, 1978. 

9 . P. Halmos, Measure Theory, Springer- Verlag, New York, 1974. 

10 . R. A. Holley, Remarks on the FKG inequalities, Comm. Math. Phys., 34 (1974) 227-231. 

11. O. Kallenberg, Random Measures, Academic Press, London, 1976. 

12 . T. Kaijser, On a new contraction condition for random systems with complete connections, Rev. Roumaine 
Math. Pures Appl., 26 (1981) 1075-1117. 

13 . L. Le Cam, An approximation theorem for the Poisson binomial distribution, Pacific J. Math., 10 (1960) 
1181-1197. 

14 . P. Levy, Processus Stochastiques et Mouvement Brownien, Gauthier- Villars, Paris, 1948. 

15 . T. Lindvall, A probabilistic proof of Blackwell’s renewal theorem, Ann. Probab., 5 (1977) 482-485. 

16. H. W. Lotwick and B. W. Silverman, Convergence of spatial birth and death processes, Math. Proc. 
Cambridge Philos. Soc., 90 (1981) 155-165. 

17 . K. Matthes, J. Kerstan and J. Mecke, Infinitely Divisible Point Processes, Wiley, Chichester, 1978. 

18 . D. R. McDonald, On the Poisson approximation to the multinomial distribution, Canad. J. Statist., 8 (1980) 
116-118. 

19 . J. W. Pitman, Uniform rates of convergence for Markov chain transition probabilities, Z. Wahrsch. Verw. 
Gebiete, 29 (1974) 193-227. 

20 . A Prekopa, On secondary processes generated by a random point distribution of Poisson type, Ann. Univ. 
Sci. Budapest. Eotvos Sect. Math., 1 (1958) 153-170. 

21 . R. J. Serfling, A general Poisson approximation theorem, Ann. Probab., 3 (1975) 726-731. 

22. D. W. Stroock and S. R. S. Varadhan, Multidimensional Diffusion Processes, Springer- Verlag, New York, 
1979. 

23. L. N. Vasershtein, Markov processes on countable produce spaces describing large systems of automato, 
Problemy Peredaci Informacii, 3 (1969) 64-72. 

24 . W. Whitt, Bivariate distributions with given marginals, Ann. Statist., 4 (1976) 1280-1289. 



